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ABSTRACT. We prove a non-homogeneous Tl theorem for certain bi-parameter 
singular integral operators. Moreover, we discuss the related non-homogeneous 
Journe's lemma and product BMO theory. 
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1. Introduction 

This paper deals with singular integrals with two attributes: 'bi-parameter' 
and 'non-homogeneous'. Both of them have been investigated before but, as 
far as we know, only one at a time. Here, for the first time, we work in the 
simultaneous presence of both complications. 

To be more precise, we study bi-parameter singular integrals T acting on some 
class of functions with product domain R n+m = W 1 x M m . However, instead of the 
Lebesgue measure we equip W l+m with a product measure yt, — pL n x /x m , where 
the measures fi n and /i m are only assumed to be upper doubling (a condition that 
is more general than the assumption [i n (B(x,r)) < Cr s , fi m (B(x,r)) < Cr l ). We 
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establish a Tl theorem, i.e. a boundedness criterion for singular integral opera- 
tors, in this setting. 

Such a result touches on quite a few topics. We now lay down some of the con- 
text. After the classical Tl and Tb type theory by David and Journe [4] and David, 
Journe and Semmes 13, the first Tl type theorem for product spaces was proved 
by Journe [14]. Journe formulated the assumptions in the language of vector- 
valued Calderon-Zygmund theory. Recently, Journe's result was challenged by 
a new product-space Tl theorem by S. Pott and P. Villarroya 112011 , who introduced 
an alternative framework avoiding the vector-valued assumptions. Following a 
similar philosophy, the latter author proved a bi-parameter analog [16] of the rep- 
resentation theorem of the first author Q - a particular consequence of which is 
yet another bi-parameter Tl theorem of a more dyadic flavour than ||20| . 

All the classical multiparameter methods including Journe's covering lemma 
lfj~3l and the product BMO, Hardy space and multiparameter singular integral 
theory by Chang and Fefferman JT], (2), Fefferman Q and Fefferman and Stein 
|7| are in the doubling situation. The same is true for the results of the previ- 
ous paragraph, and for the other related modern developments like 13, Ifl5i |2~T| 
and |[22||. These deal with variations of Journe's covering lemma, multiparameter 
paraproducts and dyadic versions of multiparameter function spaces. Since all 
the relevant results and definitions, both classical and modern, are in the dou- 
bling situation, we have to consider Journe's lemma, (dyadic) product BMO, H 1 - 
BMO type duality results, singular integrals and multiparameter paraproducts 
all from the new perspective of general measures. 

The following explains the specific need for these types of results. Even in 
the one-parameter theory of non-homogeneous singular integrals, one needs to 
replace the familiar BMO condition by 

f\f-h\ p d^<Lii{Kl), 

where a parameter n > 1 specifies an expansion nl of the cube 7; this can have 
a much larger measure than / when fi is non-doubling. Accordingly, we need to 
define and study a new space BMO pro d(yu), which is used in the formulation of 
our main theorem. To show that the new condition Tl G BMO pro d(/i) is necessary, 
we need a version of Journe's covering lemma for general product measures. 
Finally, to handle some mixed paraproducts, we need a certain new if 1 -BMO 
type duality inequality. 

Our basic proof strategy of the Tl theorem uses dyadic probabilistic techniques 
adapted to the bi-parameter situation. These were already used by the latter au- 
thor in fl6l but, again, only in the doubling case. The dyadic and probabilistic 
methods of non-homogeneous analysis were pioneered by Nazarov, Treil and 
Volberg (see e.g. the non-homogeneous Tb theorems [|18| and I1T91P . However, 
these powerful tools are not widely used in multiparameter harmonic analysis, 
and some extra care is needed. In this regard the current paper is a continuation 
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of the recent developments (see e.g. 00, ffl, OH, ED, OH, El) in the probabilis- 
tic methods. 

The proof starts by expanding in a product Haar basis adapted to the general 
measures (j, n and fi m . The summation in the bi-parameter case is rather massive 
and the non-homogeneous techniques are needed in various parts of the summa- 
tion. All kinds of bad-boundary terms appear - also in some new mixed situa- 
tions. Moreover, there is a wide variety of new non-homogeneous paraproducts, 
some of them related to the new space BMO pro d(/i). 

Just like in ||T6l our operators are defined using the philosophy of Pott and Vil- 
larroya |20|. That is, we mostly avoid the language of vector-valued formulations 
used in the original work of Journe |[T4|. Of course, our kernel estimates are tied 
to the non-homogeneous measures. 

2. THE MAIN THEOREM 

We use this section to introduce the (somewhat lengthy) framework necessary 
for the formulation of our main theorem. We consider the following class of 
measures: 

2.1. Definition (Upper doubling measures). Let A: M. n x (0, oo) — > (0, oo) be a 
function so that r H- X(x,r) is non-decreasing and X(x, 2r) < C\X(x,r) for all 
x E R n and r > 0. We say that a Borel measure \l in W 1 is upper doubling with 
the dominating function A, if fj,(B(x, r)) < A(x, r) for all x E IR n and r > 0. We set 
d x = log 2 C x . 

The property X(x, \x — y\) ~ X(y,\x — y\) would be convenient. This is luck- 
ily something that can be arranged for free. In |fl2l Proposition 1.1] it is shown 
that A(x, r) := inf 2effi n X(z, r + \x — z\) satisfies that r H- A(x, r) is non-decreasing, 
A(x, 2r) < CaA(x, r), n(B(x, r)) < A(x, r), A(x, r) < X(x, r) and A(x, r) < C\A(y, r) 
if \x — y\ < r. Therefore, we may (and do) always assume that dominating func- 
tions A satisfy the additional symmetry property X(x, r) < CX(y, r) if \x — y\ < r. 

Henceforth, let /i = jjL n x [i m , where fi n and fx m are upper doubling measures on 
W 1 and M m respectively. The corresponding dominating functions are denoted by 
X n and X m . We use i°° metrics on R n and R m . 

2.1. Bi-parameter Calderon-Zygmund operators. We study an a priori bounded 
linear operator T: L 2 (fi) — > L 2 (fi). In addition to the usual adjoint T*, we will be 
concerned with the partial adjoint T x defined by 

{T l {f l ®f 2 ),g l ®g 2 ) = (T( gi ®f 2 )Ji®g 2 ). 

The philosophy of the following assumptions is that we impose symmetric con- 
ditions on all the four operators T, T* , T x and T*. 

2.2. Assumption (Full kernel representation). If / = fi <8> f 2 and g = g\ ® g 2 with 

fx, gi : M n -> C, f 2 , g 2 :R m ^ C, spt fi fi spt g x = and spt / 2 fi spt ^ 2 = 0, we have 
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the kernel representation 

(Tf,g}= / K(x,y)f(y)g(x)dfx(x)dfi(y) 

= // // K(x 1 ,x 2 ;yi,y2)f(yi,y2)9(xi,X2)dfi(x 1 ,x 2 )dii(yi,y2), 

where the kernel K is a function 

K: (R n+m x R n+m ) \ {(x, y) e R n+m x R n+rn : x x = yi or x 2 = y 2 } -> C. 

Note that this implies kernel representations for T* , T\ and T*. If we denote 
their kernels respectively by K* , K\ and if*, then we immediately observe the 
formulae 

K*(x,y) = K(y 1 ,y 2 ;x 1 ,x 2 ), 
Ki(x,y) = K(y 1 ,x 2 ;x 1 ,y 2 ), 
K*(x,y) = K(x 1 ,y 2 ;y 1 ,x 2 ). 

2.3. Assumption (Full standard estimates). The kernel is assumed to satisfy the 
size condition 

\K(x,y)\ < C— p - — p -, 

A n (a;i, \xx - y^) X m {x 2 , \x 2 - y 2 \) 

the Holder condition 

\K(x, y) - K(x,( yi , y' 2 )) - K(x, (y[, y 2 )) + K(x, y') \ 

< C- 
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whenever \y\ — y[\ < \x± — yi\/2 and \y 2 — y' 2 \ < \x 2 — y 2 \/2, and the mixed Holder 
and size condition 

\K(x,y)-K(x, (y' 1 ,y2))\<C 

whenever \yi — y[\ < \x± — yi\/2. The same conditions are imposed on K*, K\ and 
K* as well. 

We need to assume some Calderon-Zygmund structure on R™ and R m sepa- 
rately. 

2.4. Assumption (Partial kernel representation). If / = fi <g) J 2 and g = g\ ® g 2 
with spt fi fl spt g\ = 0, then we assume the kernel representation 



{Tf,g}= / AT/ 2iff2 (xi,yi)/i(2/i)^i(xi)d/x n (xi)d// n (j/i). 
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2.5. Assumption (Partial boundedness x standard estimates). The kernel 

K hm . {W 1 x W 1 ) \ {(x 1 ,y 1 ) £ l n x I™ : xi = Vl } 
is assumed to satisfy the size condition 

and the Holder conditions 

\ K h, 9 2( x i,yi) - ^/ 2 ,g 2 K,?/i)| < c(f 2 ,g 2 ) 

whenever \x\ — x[\ < \x\ — yi\/2, and 

\ K h, 92 (xi,yi) -K h , g2 {x u y[)\ < C(f 2 ,g 2 ) 
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whenever \yi — y[\ < \x\ — yi\/2. 
We assume that C{f 2 , g 2 ) < IIMIl^^)!!^!!^^)- 

We assume the analogous representation and properties with a kernel Kf ugi in 
the case spt/ 2 H sptg 2 = 0. 

2.6. Remark. As is clear, the final bound is not allowed to depend on \\T\\ L 2^^ L 2^y 
However, note that we do assume some type of L 2 boundedness separately on 
(M n ,/i n ) and (M m ,/i m ) (the bounds for C(fi,gi) and C(f 2 ,g 2 )). This is in slight 
contrast with some of the most recent works in the homogeneous bi-parameter 
setting ill6| . There one may work with a bit weaker testing conditions for C(-, ■). 
However, this seems to be a standard assumption in the classical homogeneous 
works, and we also require it in our non-homogeneous setting. 

2.7. Assumption (Full weak boundedness property). We assume the following 
weak boundedness property: for every cube / C IR n and J C M m there holds that 

I (T(xi ® Xj), Xi ®Xj)I < C/i n (5/) y u m (5 J). 

Note that this condition is invariant with respect to the replacement of T by 
either T* , T x or T*. 

Moreover, we need some type of diagonal BMO assumptions. This amounts to 
replacing exactly one of the indicator functions above by a cancellative function 
supported on the same cube. More precisely: 

2.8. Assumption (Partial weak boundedness x BMO condition). Let I cW 1 and 
J C W a be cubes and a If aj be functions such that J aj dfi n = J ajd[i m = 0, 
spta/ C / and sptaj C J. We assume that 

\{T{xi®Xj),ai®Xj)\ + \ {T*{xi®Xj),ai®Xj)\ < C\\ 
and 

\(T(xi ® Xj), Xi ® aj)\ + \(T*( X i ® Xj), Xi ® < C/i n (5I)\\ (5J)^. 
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Note that no new conditions arise by replacing T by T\ above. The reason for 
calling this a BMO condition will become more clear in the course of the proof. 
Observe that both the full weak boundedness and the partial weak boundedness 
x BMO condition are obviously necessary conditions for the boundedness of T 
on L 2 (n). In fact, they would both follow from the following stronger diagonal 
testing condition 

\\Xi ® XjT(xi ® Xj)\\l^) + \\xi ® XjT*(xi ® Xj)\\z>M < C/i n ,(5J) 1/2 /i m (5 J) 1/2 . 

2.2. Dyadic grids and Haar functions. In order to formulate the actual "T\ G 
BMO" conditions in the present setting, it is is necessary to introduce some nota- 
tion related to dyadic martingale difference decompositions and Haar functions. 
Let V® and be the standard dyadic grids on IR n and R m respectively. 

2.2.1. Random dyadic grids and good/bad cubes. Weletw; n = {w nti ) i& ,w' n = (w' ni ) ieZ , 
w m = (w m j) jeZ and w' m = (w' mJ ) j& , where w n ^w' ni G {0, l} n and w m j,w' mj G 
{0, In W l we define the new dyadic grid V n = {I + 2-(<<m 2~ l w nji '■ 
I G V° n } = {I + w n : I E T>° n }, where we simply have defined I + w n : = 
/ + Y^i- 2-* <£{i) ^~ tyj n,i- The grids V' n , V m and V m are similarly defined. There 
is a natural product probability structure on ({0, l} n ) z and ({0, l} m ) z . Even if 
n = m and despite the notation we agree that we have four independent random 
dyadic grids. 

A cube h G V n is called bad if there exists J 2 G £>; so that £(I 2 ) > 2 r i(h) 
and d(h,dl 2 ) < Ai(h)^ i{I 2 f^ . Here £(J) denotes the side length of I and 
r ) n = aj (2d\ n + 2a), where a > appears in the kernel estimates and, we recall, 
d\ n = log 2 C\ n . The parameter r > will be fixed later. Note also that here bad 
really means T>' n -bad, but this is not usually spelled out. The badness in R m is 
defined similarly (it involves the same parameter r and 7 m = (5/ (2d\ m + 2/3)). 

It is easy to check that, almost surely with respect to the canonical probabil- 
ity on ({0, l} n ) z , a random dyadic system V n has the following "no quadrants" 
property: Whenever I k G V n , k G N, is a strictly increasing sequence (meaning 
that h C ifc+i) of dyadic cubes, then this sequence exhausts all of IR n (meaning 
that UfceN Ik = ^ n )- By throwing away a subset of the probability space with 
probability zero, we can and will assume that all our dyadic systems possess this 
additional "no quadrants" property. This convention is a matter of convenience, 
which has no effect on any of the probabilistic statements that we are going to 
make. 

2.2.2. Haar functions on M. n with general measures. The content of this subsection is 
from IfTOH . Let V n be one of the dyadic grids on M™ as above and \i n be any locally 
finite Borel measure on W 1 . We set 

Eif = (f)iXi, 

A// = [(f)p - (f)i]xi>, 

/'ech(J) 
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where ch(J) = {Ij : j — 1, . . . , 2 n } is the collection of dyadic children of /, and 
(/)/ = ^n{I)~ l fj f dfi n . For any f 6 Z and / G L 2 (fi n ) we have the orthogonal 
decomposition 

/= E if+ E 

«(I)=2< £(J)<2* 

We index the children Ij of 7 in such a way that 

2 n 

Mn(4) > [1 - (fc - l)2" n ]/i n (J), 4 = |J I u . 

u=k 

Hence yu n (/ fc ) ~ jt/ n (I) for every A;. We then continue to decompose 

Eif = (f,h It0 )h Ii0 , 

2 n -l 

A// = ^2{f,h I>v )h I>v , 

T,=l 

where h I:0 = {i n {I)~ x l 2 xi and 



hi. 



/i„(4)Ai n (4+i) V /2 



2 n - 1. 



If /Xn(4) = 0, we set = 0. 

We have the orthonormality (hi uVl ,hj 2:ri2 } = 5^5^. Moreover, for rj ^ we 
have cancellation: J /i/ ir) <i/i n = 0. Finally, h ItV is supported on / and constant on 
its children. We have established that {h Ifi : £(/) = 2 e } U : £(I) <2 e ,7]e 
{1, . . . , 2 n — 1}} is an orthonormal basis for L 2 (/i„) with the above mentioned 
properties. 

Suppose now that = x [i m , where /i n and /i m are locally finite Borel mea- 
sures on M n and IR m respectively. Assume also that we are given a dyadic grid V n 
on R n and a dyadic grid V m on W 71 . We denote the Haar functions on W 1 with re- 
spect to the measure [i n by h IjV , I e V n , r\ = 0, . . . , 2 n — 1, and we denote the Haar 
functions on IR m with respect to the measure /i m by u J>K , J e P m/ k = 0, . . . , 2 m — 1. 

2.3. BMO conditions. We recall the definition of the BMO space relevant in the 
upper doubling setting from IIT9l[TT| . 

2.9. Definition. We say that / e L\ oc (^ n ) belongs to BMO^(/i n ), if for any cube 
I CW 1 there exists a constant // such that 



[f\f-h\ p d^ llP <L^ n {Kl) 



i/p 



where the constant L does not depend on /. The best constant L is denoted 

H/||bMO£ (flu)- 

The following lemma from IIT91ITT1I motivates our definition of the BMO space 
for a product measure fx n x /i m : 
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2.10. Lemma. For all J G V' n , 

( E I(/.^)I 2 ) 1/2 ^^(j) 1/2 ii/Hbmo^(m- 

IeV n good 

icJ 
e(i)<2~ r e(j) 

2.11. Definition. We say that b G BMO prod (/i), if for every V = V n x V m and 
V = V' n x V' m there holds that 

(EEE E k^®^>i 2 ) 1/2 < Lm 1 ' 2 

v =l k=1 SeV R=Ix JeOgood, RcS 
Sett gen(i?)=gen(5)+(r,r) 

for all sets f2 c ]R n+m such that < oo and such that for every x G f2 there 
exists S G £>' so that i G 5 C n. The best constant L is denoted ||&||BMO prod (M)- Here 

gen(R) := (gen(7), gen(J)), 

and gen(/) is the usual generation of a dyadic cube (if = 2~ k then gen(J) = k). 
Moreover, the fact that R is good means that / is £>' t -good and J is P^-good. 

2.12. Remark. In the definition of BMO pro d(/i), we can equivalently restrict to 
considering sets f2 which, in addition to the stated property, are also bounded. 
Namely, if the defining inequality holds for this more restricted class of sets, and 
f2 is any set as in Definition 12.111 then we simply check the restricted product 
BMO condition for bounded sets of the form f2 D S^, where Sk is the union of the 
2 n+m dyadic cubes in of sidelength 2 fc that lie closest to the origin. We conclude 
that the square sum over S C Q H Sk is dominated by Lfi(Q n Sk) 1 / 2 < Lfi(Q) 1 ^ 2 , 
and taking the limit as k — > oo, we find that the full sum over S C f2 is also 
dominated by LiiiVt) 1 / 2 , as required in Definition 12.1 1[ 

We can now formulate our main theorem: 

2.13. Theorem. Let /i = /i n , x /i m , where fi n and \i m are upper doubling measures on M" 
and M m respectively. Let T: L 2 (/j,) — > L 2 (n) be a bi-parameter Colder vn-Zygmund op- 
erator, which is a priori bounded. Assume that T satisfies all the Assumptions formulated 
in this section. Moreover, assume that 

SI e BMO prod (/i) VS G {T, T*, Ti, T*}. 

Then there holds that \\T\\ < 1, a bound depending only on the Assumptions and the 
BMO pro d(/i) norms of the four SI, but independent of the a priori bound. 

2.14. Remark. Note that at least for a large subclass of operators (see Section[8]) our 
conditions, except for the conditions Ti(l),T*(l) G BMO pro d(A0/ are necessary. 
Indeed, Ti does not have to be bounded even if T is. This deficit is shared by all 
known bi-parameter TI theorems, already in the homogeneous setting. 



non-homogeneous tl theorem for bi-parameter singular integrals 9 

3. Strategy of the proof 

3.1. Initial reductions. The following reduction is quite standard, but we do 
it carefully, since in the bi-parameter case the details are not written anywhere 
else. We do not need an exact averaging equality as in some of the latest non- 
homogeneous work - the by now classical trick of Nazarov, Treil and Volberg 
suffices here. 

Let us fix f,g e C(R n+m ) with compact support so that 0.7||T|| < | (Tf, g) | (here 
||T|| = \\T\\ L 2^ L 2 {ll) < oo) and \\f\\m^) = ||#IU=V) = L We fix the parameter ^ so 
that spt/, spt g c [-2', 2 e ] n+rn . 

We define 

E k f = E E hh ^i ® (/> h h,m)i> 

2- k <e(h)<2 e 
E 'k9 = Yl E h h,m ® (9, h l2>m )x] 

2- k <£(I 2 )<2 e 

Ekf = Y E (fi u Ji,Ki)2®u JljK1 ; 

Kl Ji&Dm 
2- k <£(Ji)<2 e 

E'k9 = Y E (g,uj 2 , K2 ) 2 ®u j2tK2 . 

2~ k <£{J 2 )<2 e 

Here (f,h hM )i{y 2 ) = (f(-,y 2 ),h htV1 ) = ff(y 1 ,y 2 )h Il!r]1 (yi)dfi n (yx). _ 

Note that these are linear operators, / = ]xm k ^ > . 00 E k f = lim^oo E k f on L 2 (/x), 
\\E k f\\ L 2 M , \\E k f\\ L 2 M < \\f\\ L 2 M = 1 and E k E k f = E k E k f. We write 

(Tf, g) = (T(f - E k f),g) + (T(E k f),g - E' k g) 

+ (T(E k f - E k E k f), E' k g) + (T(E k E k f), E' k g - E' k E' k g) 
+ (T(E k E k f),E' k E' k g), 

or more briefly, (Tf, g) = (T(E k E k f), E' k E' k g) +e k (u),u = (uj n ,uj' n ,uj m ,uj' m ). It is 
easy to see using the above listed properties that Hindoo e k (u) = and |e fe (o;)| < 
2||T||. Using dominated convergence we deduce that 

(Tf, g) = lim E(T(E k E k f),E' k E' k g), 

k— >oo 

where the expectation E = E u . 
Note that 

E k E k f = j2 E E (/'^ 

2- k <e(h)<2 c 2- k <l(Ji)<2 e 



10 TUOMAS HYTONEN AND HENRI MARTIKAINEN 

We write 

E E = E E 

IlET> n Jl&'Dm AG©,!, good Jl&^m^ood 

2-*<^(Ji)<2* 2-*«(Ji)<2' 2-K<e(h)<2C 2~" <£(Ji)<2* 

+ E E + E E ■ 

^lSfn, good •Jl£2?m,bad ^l6X> nbad JlGCm 

2- fc <^(/i)<2 £ 2~ k <£(J 1 )<2 i 2~ k <£(h)<2 1 2~ k <£{J 1 )<2 1 

Denoting / fe = E k E k f , this gives us the decomposition 

fk /fc,good "I - fk,bad- 

Let also # fe = E' k E' k g = g k>sood + g- fcibad . 
We now write 

{Tfk,gk) = (T fk,good, 9k,good) + {Tfk,good, 9k,bad) + {Tfk,bad, 9k) ■ 

This gives us that 

\{Tf,g)\= lim I^T/fc,^)! 

< lim (\E{T f k „ ood ,g k „ ood )\ + ||T||£ , ||5f fcjbad || L 2( A1 ) + ||T||£||/ fe)b ad||z, 2 (/x)) 

k— >oo 

Estimating 

1 /2 

H/jfe.badll^O*) <( ^ Yl E Xbzd(Jl)\(f,h Ium ®Uj UK1 )\ 2 ) 
i(h)<2 e f(Ji)<2 f 

1/2 



+ (E E E Xbad(/l)| 



771 ®MJi,ki)| 2 



i(h)<2 e £(J!)<2 e 



1/2 



gives us the bound 

-S||/fc,bad||L 2 ( M ) 

<^„K^)(E E E p <( J i bad )KM/i,*®^i>l 2 ) 

£{h)<2 e ^(Ji)<2 £ 

1 /2 

+ ^,«- m ,o(E E E p^^iMKM/w®^)! 2 ) < C W' 

i(h)<2 e £(Ji)<2' ! 

where c(r) — >■ 0, when r — >■ oo. Fixing r to be sufficiently large we have shown 
that 

(3.1) 0.7||T|| < \(Tf,g)\ < lim \E(Tf k>sood , %good )| + 0.1||T||. 
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3.2. Outline of the core of the proof. The core of the proof, which will span most 
of the rest of the paper, consists of showing that 

\E\Tfk,goad, 9k,good)\ 

= \ E Y1 Yl Yl (f,hi um 0u JuK1 ) 

Vi,V2 /i, 72 good Ji,J2good 
(3-2) «i.«2 2- k <£(h),e(I 2 )<2 e 2- k <e{J 1 ),£(J 2 )<2 e 

x (g,h hiri2 ®uj 2>K2 ){T(h him ®u JuK1 ),h l2m ®mj 2jK2 ) 
< (C + O.IHTIDH/IU^IMU^) = C + 0.1||T||, 



and hence, combining with (|3.1|) , 



0.6||T|| < C + 0.1||T||, whichimplies 

||T|| < 2C, 

completing the proof of Theorem 12.131 

For the proof of (|3.2[) , we fix k (recall that £ is already fixed). We will bound 
\E{Tfk lg00 d, <7fc,good) | uniformly on these quantities. We suppress the restrictions 

2- fe <£(h),£(I 2 ),£(J l ),£(J 2 )<2 i 

from the notation. The goodness of the cubes is essential and so is the fact that we 
have the averaging operator E in front (this is used in the proof to control mul- 
tiple bad boundary regions). However, sometimes (for the collapse of paraprod- 
ucts) the fact that all the cubes are good is a problem. We deal with this during 
the proof by sometimes explicitly replacing / by / g00 d noting that (/good, hi 1>m <g> 
ujuki) = (f,h h , Vl ® uji.Ki) if h and J x are good, and (f g00 d,h Ium ® u JuKl ) = 
otherwise. 

We will focus on the part of the summation, where £(Ii) < £(I 2 ) and £(Ji) < 
£( J2). One exception of this is made during the proof to handle a certain mixed 
full paraproduct appearing in the summation £{Ii) < £(12), £{J\) > £{J2)- It has 
an essential difference to the full paraproduct appearing in our main summation, 
so it does need separate attention. 

In any case, we perform the splitting 

£=£ + £ + £• 

e(h)<i(h) e(h)<£(h) e(h)<2-n(i 2 ) 2-n{h)<t{h)<i{h) 

d(/l,/ 2 )>2^(/i)T«£(/ 2 ) 1 -T™ d(Il,/2)<2^(7i)Tn^(7 2 )l-7n d(h ,I 2 ) <M(h ) ln Zih) 1 —* 1 

These three parts are called separated, nested and adjacent respectively. The term 
nested makes sense, since the summing conditions (recalling that I\ is good) actu- 
ally imply that there is a child J 2)1 G ch(I 2 ) so that d(I u J 2 C J > 4£{Ii)' fn £(l2,i) 1 ~ nfn . 

A similar splitting in the summation £{J\) < £{J 2 ) is also performed. This splits 
the whole summation into nine parts. We explicitly deal with the following cases 
(the remaining three being symmetric to one of these): 
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(Jl,J2) 


separated adjacent 


nested 








separated 


* * 


* 


adjacent 


* 


* 


nested 




* 



The cases where the first pair I 2 ) is separated are treated in Section HI and 
the cases where it is adjacent in Section |5l The remaining cases where both pairs 
are nested are handled in Sections [6] and [3 A combination of these gives the core 
estimate (|3.2[) and completes the proof of Theorem 12.131 In the final Section [Sj. we 
investigate the necessity of the condition that Tl G BMO pro d(A0- 

3.3. Remark (Vinogradov notation and implicit constants). We will use the nota- 
tion / < g synonymously with f < Cg for some constant C. We also use / ~ g 
if / < < /. The dependence on the various parameters should be somewhat 
clear, but basically C may depend on the various constants involved in the as- 
sumptions. 

4. Separated cubes 

In this section we deal with the part of the summation, where at least the pair 
of cubes (h,h) is separated. This further splits into subcases according to the 
nature of the other pair ( Ji, J 2 ). 

4.1. Separated/separated. 

4.1. Lemma. Let h e V n , I 2 e V' n , J x e V m and J 2 e V' m be such that £{I X ) < £{I 2 ), 
t{J x ) < £{J 2 ), d{h,I 2 ) > 2l{h)iH{I 2 ) 1 -^ andd(J 1 ,J 3 )> 2£{J 1 )^£{J 2 ) 1 -^. Let 
fiu fju 9h an ^ 9J2 be functions supported on I lt J lt I 2 and J 2 respectively, and assume 
that H/jJUa^) = \\fjih^ m ) = hhWtfi^) = lbj 2 IU 2 (Mm) = 1 and f fcdfin = 
f fj 1 d/j, m = 0. Then there holds that 

mfi^fj^gi^gj^^AZAZ^ 

where 

,se P _ £(hr/H(I 2 )^ / /2 

and 

Aj ^~ D{J^J 2 )Hn Vw&Jl \ m {w,D{J^J 2 )f m{Jl) » m{J2) ■ 
Here D(h, I 2 ) = £{h) + £{I 2 ) + d(h, J 2 ). 

Proof. We write (T(f Il (g) fjj, g l2 ® g j2 ) as the integral 





K(x J y)f h (y 1 )f Jl (y 2 )g l2 (x 1 )gj 2 (x 2 ) d/j, m (x 2 ) d/x n (xi) dn m (y 2 ) dfi^yx) 

'h J J! JI 2 JJ 2 

and then replace K(x, y) by 

K(x, y) - K(x, (yi, w)) - K(x, (z, y 2 )) + K(x, (z, w)), 
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where z £ I\ and w G J\ are arbitrary. The replacement can be done inside the 
integral, since / f h d/j, n = J f 3l d/j, m = 0. 
Next, one notes that 

\yi -z\< £(h) < ZihVHih) 1 -^ < d(h,I 2 )/2 < \x x - z\/2, 

and that similarly \y 2 — w \ < \x 2 — w\/2. Using the Holder condition of K this 
yields 

\K(x, y) - K(x, (y u w)) - K(x, (z, y 2 )) + K(x, (z, w))\ 
< 





yi - A 


a 


y 2 - w\ 


P 


\xi — z 


a K(z, 


Zl - z\) 


x 2 - w\P\ m (w, 


\x 2 - w\) 



< £(h) a l{Jif 

~ d(I h I 2 )<*\ n (z, d(h, I 2 )) d(J 1: J 2 y\ m {w, d(J u J 2 )) ' 

Splitting into cases d(Ii, I 2 ) > £(I 2 ) and d(Ix, I 2 ) < £(I 2 ) one can show precisely 
like in the proof of [TT1 Lemma 6.2] that 

£(h) a < £(h) a l 2 £(I 2 ) a l 2 



d{hj 2 y\ n {z,d{hj 2 )) ~ D{i x j 2 )<*\ n {z,D{h,i 2 )y 

The result of the lemma readily follows. □ 

We recall ffTTl Proposition 6.3]: 
4.2. Proposition. There holds 

E ^v„< (5>?,) 1/2 (£*4) V2 

e(h)<e(h) h h 

for xi 1 , yj 2 > 0. In particular, there holds that 

2x1/2 /„ n n 1/2 



(E[ E ^D's(E4 



J 2 /i:^(/l)<^(/ 2 ) *1 

We are to bound 

XI \(f,h h ,vi® u Ju*i)\ 
VUV2 t(h)<t{I 2 ) e(Ji)<t(Ja) 

K 1: K 2 d(/ 1 ,/ 2 )>2f(7i)^£(/ 2 ) 1 " 7 " d(Jl,J 2 )>2£(Jl)7m£(J 2 )l-7m 

x \{g,hi 2 , V2 ®uj 2tK2 )\\(T(h hm ®u JuKl ),h l2tV2 ®uj 2tK2 )\. 
In general, we need to split 

E = E + E + E + E- 

vi,V2 m=o vi=o vi¥"0 m^o 

Kl,K 2 Kl=0 Kl^O Kl=0 KlT^O 

»? 2 ,K 2 7? 2 ,K 2 V2,K2 T)2,K2 

However, in this separated /separated sum only the fourth summation appears. 
Indeed, say 771 = 0. Then 2 £ = £{h) < £(I 2 ) < 2 e i.e. £(h) = £(I 2 ) = 2 l . If the 
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cubes 1 1 and I 2 actually contribute to the sum, one must have I\ fi [— 2 e , 2 e ] n ^ 
and l 2 n [-2 e , 2 e ] n ^ 0. But this means that d(h, h) < 2 m = 2£(I 1 )^£{I 2 ) 1 ^. 

The main term, where rji 7^ and K\ 7^ 0, is the only one that remains and can 
now be handled. In that sum Lemma l4~T1 gives 

\( T ( h h,vi ® u JuKl ), h him ® uj 2tK2 ) \ < A s ^ h A s JVj 2 . 
With fixed 771 7^ 0, ki 7^ 0, 772 and k 2 we get from Proposition 14.21 that 

E E A lTh A Tj2 I (/' h h,Vl ® «Jl,*l)l I ^2,7,2 ® «J2,« 2 }| 

£(/i)<£(7 2 )f(Ji)<^(J 2 ) 

s E ^ a (Ei</»^®«Ji,« 1 >i a ) 1/a (Ei^^ 



1/2/^,. . xl/2 
,? 2 ® ^W' 



£(/i)<<?(/ 2 ) J2 



< 



l/2 / ^^ ij _ _ xl/2 



>l 2 ) '(EEifo-** 



7?2 ™ U J2,K2 

h Jl h J2 

^ ll/IU 2 (/i) IMU 2 (ao- 

4.2. Separated/nested. We begin by remarking that in this sum we automatically 
have ?7i 7^ and K4 7^ 0. We write (T(hi im ® u JliK1 ), hj 2:V2 Cg> uj 2>K2 ) as the sum 

[{T(h htVl <g> /i/^ ® n j2jK2 ) - (uj 2iK2 )j x {T{h Iliril <g> u Jl}K1 ), h l2:V2 ® 1)] 

A2) Ji (T(hi uVl ®Uj uKl ),hi 2tV2 (g) 1). 

We start by dealing with the sum having the first term as the matrix element. Let 

J 2 ,i G ch(J 2 ) be such that Ji C J 2 ,i. Note that (uj 2)K2 ) Jl = {uj 2>K2 )j 21 . Therefore, 
we have that 

(T(hj ),hi 2 , V2 ®uj 2)K2 ) - (uj 2iK2 ) Jl (T(h h 

,771 <> Uj ltKl ),hi 2 , V2 ® 1) 

equals 

(T(hj 

,V2 <> 

,T7l U Jl,K! ,772 u J2,k 2 XJ 2 ) ■ 

J 2 ech(J 2 ) 

J 2 CJ 2\J2,1 

As Ji is good, Ji C J 2 ,i and Ji) < 2 _T 7( J 2 ,i), we have 

rf(Ji, J 2 ,i) > 4£(Ji)^£(J 2 ,i) 1 ^ > 2^(J 1 )^£(J 2 ) 1 -^. 

4.3. Lemma. T/zere JzoZds 

,^2/^2,1 1,771 5v Uj ljKl 

,m ® XJ"f i)l ~ A-hh^-JxJzi 

where 

1/2 



ViJ 2 



\£{J 2 )J W(^2,i)/ 
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Proof. We write (T(h h)Vl ® u JllK1 ), h hm ® xjcJ as the integral 

K(x,y)h IuVl (y 1 )uj uKl (y 2 )h l2:V2 (x 1 ) d/j, m (x 2 ) dfi n (xx) dfi m (y 2 ) d/j, n (yi), 




and then, using the fact that rj\ ^ and K\ ^ 0, replace K(x, y) with 

y) - K(x, (yi, w)) - K(x, (z, y 2 )) + K(x, (z, w)). 

Here z e h and w G J\ are arbitrary. Since d(h, I 2 ) > 2^(7i) 7 ™£(7 2 ) 1 ~ 7 ™ and 
d(Ji, J| (1 ) > 2£(Ji)^£(J 2 ) 1 - 7m , we have - z\ < \x x - z\/2 and |j/ 2 - io| < 
\x 2 —w\/2. Now, we have 



\x 2 - w 

Jn m \B(w,d(Ji,J£ A )) 

where 



x / t — 7 i dfi m (x 2 ), 

Jn m \B(w,d(Ji,J$ 1 )) A m\W, \X 2 — W\) 



Noting that | («j 2 , K2 )j 21 | < /im^i)"^ 2 finishes the proof. □ 

4.4. Lemma. 7/ J2 e ch(J 2 ) and J' 2 d J 2 \ J 2j i, then 

\(T(h htVl ® wj^kJ, ^i 2 ,r, 2 ® mj 2;K2 x^)| < A^j 2 Aj lJa . 
Proof. Since d(J u J' 2 ) > d{J u J| ;1 ) > 2£( Ji) 7m £( J^) 1 ^ 7 ™, Lemma ED gives that 

with any w E J\ C J 2 ,x- It is easy to see that 

(J"m(J 2 ) 1/2 < / , 

X m (w,£(J2)) 

which ends the proof of the lemma. □ 
We recall EH Lemma 7.4]: 

4.5. Lemma. There holds 



1/2 / ^ X 1/2 

2 



Jl,>/2 Jl ^2 

JicJ 2 ,iech(j 2 ) 

<(Ji)<2-^(J 2 ) 
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for x Jl , y j2 > 0. In particular, there holds that 



x 



1/2 



El E <***J) s(E 

J2 Ji J\ 

JiCJ 2 ,iGch(J 2 ) 
^(Ji)<2-^(J 2 ) 

Combining this lemma with Proposition 14.21 and the above estimates gives that 
the summation 

E E 

is dominated by ||/||z, 2 (/Li)llfl , IU 2 (M)' if we are using the modified matrix element 

Ji {^(^hm ® u Juki) j hi2,m ® 

What is left to do is to consider the same summation but with the matrix ele- 
ment 

(UJ2 

(T(h IuVl <S> u JltK1 ), h htV2 <g> 1). 

Recall that all the appearing cubes are good. In particular, J 2 is good and this is 
actually slightly problematic. However, for good I 2 and J 2 we can write (g, hi 2m ® 
«j 2iK2 ) = ((? g ood, hi 2>m ®mj 2 , K2 ), and after this we may add all the bad J 2 to the sum- 
mation, since (g g00 d, hj 2:V2 <g> u j2:K2 ) = if J 2 is bad. Moreover, in the non-trivial 
case (uj 2:K2 ) Jl ^ 0, we have J\ fi J 2 ^ so that we may remove the summing 
condition d{J 1 ,J 2 ) < 2£{J l )^£{J 2 ) 1 -^. 

In what follows we explicitly write the fact that I\ , I 2 and J\ are good, since 
we need to highlight the fact that J 2 is not. For the J 2 summation we also write 
out the hidden summing condition £{J 2 ) < 2 l . For fixed 771, K\ and r] 2 we need to 
bound 

Y Y ( Y Y^° od ' hi ^ ® w ^,«2> u J2,^) Ji 

iij-fo good Ji good J 2 K2 

t(h)<i(h) 2^t{J 1 )<£{J 2 )<2 1 
d(/i,7 2 )>2£(/i)T™^(/ 2 ) 1 -T« 

x (f,hi um ®u JuK1 ){T(h Iim ®u JliK1 ),h hjTI2 <g> 1). 

We write (# g00 d, /i/ 2l??2 ®wj 2 , K2 ) = (fl^, «j 2 , K2 ), where 0^(2/) = (# g00 d, ^2)1(2/) = 
/ R » fi'goodO, Z/)^/ 2 ,r, 2 0) dfi n (x). Then we note that 

v-, / 2l ,2 v / \ _ J *J*b&£Kv), if i(J 2 ) < 2*, 

A,ty^' tt -W tt WW I A^(^)(y) + ^ 2 (^)(y), if i(J 2 ) = 2f 

Therefore, the sum inside the average over J\ equals the constant (g^ol) S(Ji)f 
where 5(Ji) e V' m is the unique cube for which £(S(Ji)) = 2 r £(J!) and J x c S{Ji). 
Note that for this collapse it is important that J 2 is not restricted to good cubes. 
The cube S(J{) exists since J\ is good. 
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We are left to bound 



E E E E (43>Ji(%*»*Ji«).^«i) 

,^i,V2 7i,i"2good J2 Ji good 

l(h)<e(I 2 ) J1CJ2 
d(/i,/ 2 )>2^(/i)^f(/ 2 ) 1 ^ 7 ' 1 <(Ji)=2-«-f(J 2 ) 



x (f,h Ium ®u JuK1 ). 



We note that this equals 

E E 



i?i,Ki,r?2 i"i,/ 2 good 

«(/x)<i(/2) 

d(7i,72)>2£(/i)T«f(/2) 1 -T" 



® (n,^x«7a) /jj ; S'good 

J 1 J 2 



where /£ = (/, / i/l , 7?1 ) 1/ ^ 2 2 = (T*(/i Ia , TO ® 1), ^ 1)7?1 }i, and 

n a 1U,= E E ( W )j2( a ^ U Jl,Ki)uJi,K 1 , «1^0. 

J1CJ2 

e(.h)=2-re(j 2 ) 

Here, of course, the pairings and averages are taken with respect to the /i m mea- 
sure. 

There holds 

^/2,»72 ® (n^i^) /r 1 1 )fl'good 

^1 ^2 

11^1,12 7l,/2gOOd 

<(ii)<«(/a) 

d(/l,/2)>2^(/l)T"€(/2) 1 -T" 

< E E ^®(n^*r/^ 

^1 ^2 

li^i* 7i,/2good 

d(/l,/ 2 )>2<?(/l)T«^(/2) 1 -T™ 

2 \ 1/2 

IMU 2 ^) 



E 



L 2 ( M ) 



L 2 ( M ) 



I^IU 2 ^) 



E (E 

I?1,K1,?72 ^2 good 



E 

i"i good 

^(/l)<^(/2) 
<i(/l,/2)>2^(/l)T'^(/2) 1 - 7 ' 1 



J X J 2 



< 



E (E 

J?X,«X,»?2 ^2 



2 \ 1/2 



E lira II/? 

h:t(h)<e{I 2 ) 
d(/l,/ 2 )>2£(/x)T«^(/2) 1 -T» 

where we used the orthonormality of the functions hi 2iV2 e L 2 (/i n ), and the fol- 
lowing paraproduct boundedness result: 

4.6. Lemma. For any M > 1, we have 

l|na||L 2 ( Atm )^L 2 ( Atm ) % IM^MO 2 ^™)- 

The BMO norms above are estimated as follows: 
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4.7. Lemma. Ifrjx ^ 0, £(h) < £(I 2 ) and d(I x , I 2 ) > 2l{I l )^l{I 2 ) 1 -^, then thereholds 
that 

Wl\T 2 llBMO§(/x m ) ~ A hh 

forblT 2 ={T*{h h , m ®l),h h , m )i. 

Proof. We fix a cube V C M m and a function a such that spt a cV and J a dpi m = 0. 
We need to show that 

\( T ( h h,vi ® a ), h h,m ®Xzv)\ 

+ \(T(hi um ®a),h l2tV2 ®X(3V)«=>I ^ ^S 2 ll IU 2 (M m )^( 3T/ ) 1/2 - 
There holds with an arbitrary zGlj that 
\{T(h Iim ®a),h l2!m ®X3v)\ 

= / / [^a, X s V (zi,yi)-^a,X8v( x l>^)]^,tn(yi)^Ia.»»( z l) d A*n(^l)^A*n(!/l 

and this implies that 

(9 a), h hm ® X w)\ < A s %C(a, X w) < A% a \\a\\v 0lm) ii m (3V) 1 ' 2 . 
An easy consequence of the Holder estimates of the kernel K is that 

\(T(h Ium ® a), h h>V2 ® X(3y) c )| 

<^p / / , , — fr|a(!/2)| dfi m {x 2 ) d^ m {y 2 ) 

JV J(3V) C \ x 2 — CvrA m {C V , \X 2 — C V \) 

J(3V) C ^m(,Cy, \X 2 — Cy\) 

<A7S 2 \\ah^)»rn(V) 1/2 . 

This completes the proof of the lemma. 



Thus 



E[ 



E 

/ i: ^/i)<^/ 2 ) 
d(i 1 ,h)>2e{i 1 ytn£(i 2 ) 1 -~<™ 

< 



\KTJ 



2\ 1/2 



■r?i || 
i \\L 2 (p m ) 



< 



(E[ E 

/2 Ii:£(Ii)<£(l2) 
_ A s 1/2 

H/ii Hi 2 (Mm) J < ll/IU 2 (M)' 

II 



2 \ V 2 



□ 



1 

where we used Proposition [42] in the second- to-last step, and the orthonormality 
of the functions h]*, implicit in the notation = (/, in the last step. 
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4.3. Separated/adjacent. Unlike in the full diagonal, no surgery is needed in this 
part of the summation. Simply by using the kernel K UJ ,uj K2 an d the knowl- 
edge that C(u JljK1 ,Uj 2iK2 ) < \\u JuK1 \\ L 2 ifMm) \\uj 2tK2 \\ L 2 M = 1, we see like before 
that separation in W n gives 

\(T(h h 

For a given J x there are only boundedly many J 2 for which 2~ r £( J 2 ) < £( Ji) < 
£(J 2 ) and rf(Ji, J 2 ) < 2£(Ji) 7m £(J 2 ) 1 " 7m . Combining this fact with Holder's in- 
equality and Proposition 14.21 readily yields that the corresponding summation is 
dominated by WfWm^WgW^^)- 

5. Adjacent cubes 

We proceed to the case where the pair (ii, J 2 ) consists of adjacent cubes. Since 
all separated cases were already handled, this leaves us with the two possibilities 
for the other pair ( J 1: J 2 ) of being either nested or adjacent. 

5.1. Surgery for adjacent cubes. For the analysis of the adjacent pairs (h,h) 
handled in this section, we will need to carry out a further splitting introduced 
by Nazarov, Treil and Volberg under the name surgery. We introduce it here in a 
pleasently compact form. 

Consider two adjacent cubes Ii, I 2 with < £(I 2 ). Let 9 e (0, 1). We perform 
surgery on (I u J 2 ) with parameter 9 > 0. Let j(9) e Z be such that 2~ 21 9 < 2 j{ ^ < 
2~ 20 9. Let V* be yet another random grid in W 1 , independent of all other grids 
considered. Let G := {g e T>* : i(g) = 2 j( - e k(h)}, and for x G M n , let G{x) be the 
unique cube in G that contains x. We define 

Ji,a := {x e h : d(G(x), dl 2 ) < 9£(I 2 )/2} U {x e h n J a : <9G(x)) < 0^(G(z))}. 

Thus points in I^g belong to I\, and are either close to the boundary of J 2 , or to 
the boundary of the grid G. The set I^q depends on the set J 2 as well. However, 
we have 



lip C Ii,bad h n 

(5.1) 



|J {x:d(x,^)<^)} 

^(/i)<^(^)<2^(/i) 

U |J {x:d(x,9y)<^(y)} 

/(ff)=2« fl >*(/i) 



which depends only on Ii and the grids and V* n . 
We set 

A,sep :=ii\(/i, 9 U/ 2 ), 
the part of I\ strictly separated from I 2 . Finally, we have 

h,A ■= h\(Ii, 9 Ul ltSep ) = {JL U 
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where each L { is of the form L f = (1 — 6)g D I\ D 1% for some g EG, and #z < e 1. 
In fact, Lj is of the form L { = (1 — 0)g unless it is close to the boundary of I\, it 
cannot be close to the boundary of I 2 , since such cubes were already subtracted 
in the I^g component. 
We have the partition 



A key observation is that all c I\ D h appearing in the first union are cubes 
(of the form (1 — 6)g for g E G) unless they are close to dl\, and they are never 
close to dl2, while the Lj in the second union are cubes unless they are close to 
dl 2 , and they are never close to dl\. Thus, all Li = Lj that appear in both unions 
are cubes and then 5 Li c h D h- 

5.2. Adjacent/nested. The structure of the decomposition is the same as in the 
separated /nested case. Namely, we write (T(h IliV1 (g> u JliK1 ), h l2tV2 (g> uj 2 ,k 2 ) as the 
sum 

[(T(h htVl (g) u JltKl ), h h)Tt2 <S> uj 2)K2 ) - (uj 2 , K2 ) Jl (T(h h ),h h , m g) 1)] 



Again, we start by dealing with the sum having the first term as the matrix ele- 
ment. Let J 2 ,i G ch(J 2 ) be such that J\ c J^\- Note that (uj 2)/t2 )ji = (uji,^ .h.y 
Therefore, we have that 

{T(h Il>m ® u JljKl ), /ij 2))B <g> u j2jK2 ) - (n j2A2 ) Jl (T(/i /liT?1 ® u Jl)K1 ), /i/ 2ir?2 ® 1) 

equals 

(T(h Ium g> u Jl>M ), /i/ 2fl2 <g> XjiJ 




and in a completely analogous manner also 




+ (^J 2 




5.2. Lemma. T/zere fooZds 



in 
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5.3. Lemma. If J 2 e ch(J 2 ) and J 2 C J 2 \ J%\, then 
\{T(h hjVl ®u JljK1 ),h l2 

Indeed, the proofs are similar as before except one uses the kernel K hl ^ >h n 
and the fact that C(hi ltV1 , hi 2iV2 ) < 1. Recalling Lemma |4~5l and the fact that for 
a given I\ there are only boundedly many I 2 for which 2~ r £(I 2 ) < £(h) < ^(^2) 
and d(Ji, 1 2 ) < 2£(J 1 ) 7 "£(J 2 ) 1 ~ 7 ", it is immediate that the corresponding series is 
dominated by H/H^IMU^)- 

Therefore, we need only to consider the same summation but with the matrix 
element 

(uj 2 

Exactly as we have seen before, the series with this matrix element collapses to 

^htVa ® (n^ija) fj^ , g good 

^1 ^2 

m,Ki,r]2 h, h good 

2-^(/ 2 )<£(/ 1 )<f(/ 2 ) 
rf(-fi,-f2)<2£(/i)^^(/ 2 ) 1 - 7 " 



E E (( n 6';i; ? 2)*/7 1 1 7fl , good,/ 2 )2 ; 



Vi, K i,V2 h,h good 



where = (f,h Ium ) lf g£ oodj2 = (g g00 d, h l2m )i, and ^ 2 2 = (T*(h h>m 01), 
The summing condition tying fx and J 2 m the first line has been abbreviated to 
Ii ~ I 2 in the second. 
We can then estimate 



^ ((nS^a) //j 1 , fi'good,/ 2 )2 
ii ,J 2 good 

i/ 2 1 1 BMO| 5 (/i m ) II//? II 



/l,/2 

where we also used the simple bound that Ibgood^lU 2 ^™) ^ I^IU 2 ^™)/ which 
follows at once from the Haar expansions. 

5.4. Lemma. There holds that 

raiHMD?.^) < C(9) + C\\T\\(Hl bad + Hl had ) 

where 

/ \ 1/2 

^ech(/<) 

and i^bad is defined as in (|5.1|) (£/iaf z's, we wse surgery with -parameter 6 > 0). 
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Proof. We fix a cube V C M m and a function a such that spt a C V and J a d/i m = 0. 
We should show that 

\( T ( h h,vi ® «)> ^ ® 1)1 < [CW + C||T||(^ bad + flJJUIall^j^ClSV) 1 / 2 . 
We split 1 = X3V + X(3\o=- The Holder property of K hli ^ hl2m gives that 

\(T(h Ium ®a),h htV2 ®X(3vr)\ < IMU^^mOO 172 - 

For the remaining part with xzv in place of I, writing the Haar functions hi uVi 
as linear combinations of Xi'r where I[ e ch(/j), we are reduced to bounding 

\( h hm)rM h h > m)iL\\( T (xi' 1 ®a),Xi! 2 ®X3v)\. 

hech(h) i 2 €ch(i 2 ) 
We concentrate on one such pairing 

(T(Xh ® a),xi 2 ® Xsv), 

where we have dropped the primes from I[ for brevity. This will be handled by 
applying the surgery to the pair (Ji, J 2 ). 
We write 

aG{sep, 9} 
/3e{sep,0} 

+ 2<r(xL i ®o),X£ i ®Xsv) 

+ 5^<r(xL i ®a),XL i ®X3V>. 

If a = sep or j3 = sep or % 7^ j, then the corresponding pairing is seen to be 
dominated by 

C(^) / i n (J 1 ) 1 / 2 / Un(/ 2 ) 1/2 ||a|| i2(/ , m)/ i m (3V) 1 / 2 

using the size estimate of the kernel K aiX3V together with the fact that the sets are 
separated by c(9)i(Ii) ~ c(9)£(I 2 ). In the case i / j a further large dependence 
on 9 is gained from the summation 1- 

If i = j, then by the diagonal BMO assumptions the corresponding pairing is 
dominated by 

C > u„(5 J L. t )||a|U 2(/ , m) /x m (15V) 1/2 < C f Mjl (/ 1 ) 1/2 y ar l (/2) 1/2 ||a|U 2(/x „ i)> u m (15^) 1/2 . 

A factor C{6) is gained from the summation ^ i=J 1. 

Finally, the sum of the cases a = d and = d is dominated by 

The BMO estimate now readily follows. 

□ 
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Recalling that 



E HAT II ■m ) 

heT>„ 



< 



£ 2 (m)> 



E 1 



\9i 2 llL 2 ( Mm ) S \\9\\l 2 {^ 



we now know that 



hh 1 1 BMOf 5 (jm„ ) 1 1 fll I \l 2 (jim) 1 1 fl'/a 1 1 L 2 



£ EH" 



(Mm) 



/ 2 €X>; /iGX> n 
Jl~/2 



<c(e)||/|u 2M ibiu V) + c7||r||[ E (K*J 2 K\\l 2 



2 (Mm) 



1/2 



MU 2 (m) 



+cimi ii/ii^) [ E ( F w) 2 ii4 



2||^2||2 

2 \\L 2 (fl m ) 



hev> 



1/2 



The last two terms are symmetric to each other. For example, the first one can 
be estimated by 

E^E^KH^^) 2 ] = E \( h h,Vl)l' 1 \ 2E < E ^[f J 'n{l'l ) ba.d)] 

^ech(Ji) 

<c{9) E \(h h , m )i { \ 2 M)<c{9), 
^ech(ii) 

where c(6*) — >■ 0, when 9 — > 0. Furthermore, we have 



2 (Mm) 



< 



^<[EW,bad) 2 H/Tlli 2 

" ex>„ 

[ e ^M(K*j 3 ]\\m\h 



(Mm) 



hev n 



1/2 



<<#)[ E ii/ 



m ||2 

ii llL 2 (/z m ) 



1/2 



<c(WlU> M . 



We have proved that the expectation of the series with the matrix element 

1,771 U Jl,Kx ,m ± / 

is dominated by C{9) \\ f\\ L 2^ || g \ \ L 2 {im) + c{9) \ \ T\\ \ \ f \ | ^ || g\\ L 2 (m) . Fixing 9 small we 
establish the bound C'||/|| L2(At) ||^|| i2(At) + 2- 100 ||T||||/|| i2(/ , ) ||^|| i2{At) . 
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5.3. Adjacent/adjacent. We need to estimate the pairing 

= Y ( h h,m) i[( u Ji ,Kl ) J[ ,K 2 ) J' 2 

T' T 1 J 1 V 

where the summation is over all dyadic children /{ of I\, I' 2 of J 2 etc. 

5.5. Lemma. For adjacent pairs (ii, I 2 ) and (J x , J 2 ), we have 

\( T ( h h,vi ® u Ji,Ki)i h i2,m ® U J2,^)\ 

< C(e) + C\\T\\ (Hl bad + Hl had + H? lfmd + H^ had ), 

where e > is the surgery parameter both on W 1 and R m . 

Proof. In the surgery of ( J 1; J 2 ) we write Ki for the sets that correspond to the sets 
Li in the surgery of (Ji, J 2 ). We now decompose 

(^(X/i ® XjJ, X/2 ® Xj 2 ) = Y ( T (Xh, ai ® XjJ, X/2 ® Xj 2 ) 

aiS{sep, 9} 
/3ie{sep,a} 

+ J] ( T (^i,a ® XJi, Q2 ),X/ 2 ,a ® Xj 2 ) 

02G{sep, 9} 
/3 2 G{sep,9} 

+ X ( T (^i ® XJi,a)> XL n ® XJ 2 ,a) 

+ X Y^ T ^ XL n ®XK t2 ),XL H ®XK ]2 ) 

n=ji «2=j2 

M H = ji and ^2 = J2/ then the weak boundedness property gives that the 
corresponding pairing is dominated by 

Cfi n (5L n )fi m (5K l2 ) < CfiMi) 1/2 Vn(h) 1/2 fiMi) 1/2 vM) 1/2 - 

A factor C(e) is picked up from the summation ^2 il=jl J2 i2= j 2 1- 
The sum of the cases a\ — d and 0x — d is dominated by 

imiw/x.bad) 1 /^ 

The sum of the cases a 2 = d and /3 2 = d is dominated by 

||T||/i n (/ 1 ) 1 /Vn(/ 2 ) 1/2 (^ m ( Jl,bad) V2 /i m (^) 1/2 + Mm( JlJ^mC J 2 ,bad) 1/2 ). 
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In all the other cases we have separation in M n by e£(Ii) ~ e£(/ 2 ) or separation 
in W m by e£(Ji) ~ e£( J 2 ). Partial kernel representations give the bound 

L7(6) / i n (J 1 ) 1/2 /in(/ 2 ) 1/2 /im(Jl) 1/2 /i m (J2) 1/2 . 

We have shown that 

\(T(xii®XJx),Xi2®XJ*)\ 
<C(e)fr(I 1 ) 1 ftfr(I 3 ) 1 ^(Ji) 1 ' 2 t i m {J 2 y' 2 

+ n|(^(I llba d) 1/2 /in(/ 2 ) 1/2 + ^(/l) 1/2 /i„.(/ 2 ,bad) 1/2 )M m (Jl) 1/2 /i m (J 2 ) 1/2 
+ ||T|| /Un (/ 1 ) 1 / 2 /U „(J 2 ) 1 / 2 (/i m ( J 1 , bad ) 1/2 /i m ( J 2 ) 1/2 + /l m ( Jl) 1/2 /i m ( J 2 ,bad) 1/2 )- 

Apply this to | (T( X i> ®Xj{), Xi> 2 ® Xj 2 > 1/ where J( G ch(Ji), V 2 G ch(/ 2 ), J( G ch(Ji) 
and J' 2 G ch( J 2 ) to get the claim. □ 

With this lemma at hand, we see in the same way as in the adjacent/ nested 
case that 

),hi 2 

is dominated by C(e)||/|| L 2 (/l) ||^|| L 2 (/i) + c(e)||T|| H/Hl^Hs-IU^), where c(e) ->■ 
when e — >■ 0. Again, fixing e small we establish the bound C'||/||i2( /i )||y||L 2 M + 
2- 100 ||T||||/|| L2(M )|klU 2 (,). 

6. Nested cubes 

The only case left to deal with consists of quadruples of cubes such that Ji C 
I 2 ,\ G ch(J 2 ) and J x C J 2 ,i G ch( J 2 ). We then split 

(T(h Ium ® u JliK1 ), 1 ® SjJ j2 ) 
(T(h hm ®u JuKl ),sf ih ® 1) 

+ (hh,m)h( u J2,K2)ji 

(T(h h>Vl ®u JltKl ),l), 

where 

s /i/ 2 = Xii 1 (h hm - (h l2iTl2 ) i 2>1 ) = -(hi^h.iXi^ + X hi2,mXi 2 

/ 2 C/ 2 \/ 2 ,i 

and 

S J?J 2 = Xj^iuj^ - (u j2>K2 )j 2!l ) = -{uj a>Ka )j a ,iXJ§ tl + X wj 2 ,« 2 X4- 

J 2 Gch(J 2 ) 
J 2 CJ 2 \J 2 ,i 
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6.1. Terms involving separation. We deal with the first three terms in the split- 
ting, where at least one of s^ 7 or Sj 2 iJ2 appears. 

We first consider the sum having the matrix element (T(hi liVl ® u j 1)K1 ), sf j ® 
s K2 ) 



6.1. Lemma. There holds that 

\{T(h h , m ® u JuKl ), sf il2 s ^ J2 )| < Ag^j, . 

Proo/. We write out (T(hi ljVl (g) uj 1)/tl ), s^ /2 ® Sj*j- 2 ) as the following sum of four 
terms: 

{hn, V2 )i 2 ,A u J^2)j2,i{ T {hi 1 ,r 11 <8> uj^J^/s,! ® Xj^) 
J 2 ech(J 2 ) 

4 CJ 2\J2,1 

,«2/ J2.1 

^ech(J 2 ) 

I 2 Ch\h,l 

,rji ® u Ji,ki ),hi 2 ,V2Xi 2 ® u J2,k2Xj 2 ) ■ 

/ 2 ech(7 2 ) J 2 ech(J 2 ) 
I 2 Ch\h,l J 2 cJ 2 \J 2 ,i 

We handle the second term here (the rest are bounded using the same tools). 
We record that 

> 4£(/ 1 )^(J 2il ) 1 -^ > 2£(/ 1 )^£(J 2 ) 1 ^" 

and 

d( Ji, 4) > ^1, ^2,1) > 4£( J0^£( Js,!) 1 "^ 

= 4£(J 1 ) 7m ^(J^) 1 - 7 '" > 2£(J 1 y™£(J 2 ) 1 ~" /m . 

Recall that 771 7^ and fti 7^ (that is, the corresponding Haar functions have zero 
means), since for example I\ is (much) smaller than I 2 and so i(h) < 2 e . Thus, 
with any z E I\ and w G J\ we may write 

(T(h Ium <g> « JllKl ), X/ 2 c a ® mj 2iK2 Xj 2 ) 

y) - (yi, iu)) - K(x, (z, y 2 )) + K(x, (z, w))} 

x h Iim (y l )u Jl>K1 {y 2 )uj 2tti2 (x2) dfi m (x 2 ) d^ n (x x ) dfi m (y 2 ) d^yx). 




h JJi J J|, JJ' 2 
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We get the bound 

\(T(h hm ® u JltK1 ) } Xii A ® uj 2 ^ 2 xj> 2 )\ 

\x\ — z\~ 



A Mm^lJ PmW2 jl t t/\R\ I Jli jf\\ 

d( Ji, J^)P\ m (w, d(Ji, J' 2 )) 

As in the proof of Lemma |4~T1 we get that 



d{JuJ'2Y\n{w,d{J u J' 2 )) ~ D(Ji, J' 2 y\ m {w,D{J x ,J> 2 )) 

l{J x fl 2 



< 



i(j^/ 2 x m (w,£(j^)y 

As we have also previously seen, there holds that 

|.Ti — z\ 



/ 



l n \B{z,d(h,I% A )) ^n(z, pi — Z|) 

and 



d^(a;i) < dill,!*,)'* < l{h)- a '*l{I 2 



It remains to recall that | (hj 2:m )j 21 1 < jU n (/2,i) -1 ^ 2 - Indeed, putting these estimates 
together readily yields the bound Af ll2 A 1 ^ h . □ 

Combining the previous lemma with Lemma 14.51 immediately gives that the 
summation with the matrix element (T(hi 1>m <8> u Jl;K1 ), sf ll2 ® Sj 2 j 2 ) is dominated 

by ||/IU2 ((U) ||^|| L 2 M . 

We now deal with the sum having the matrix element (uj 2tK2 )j 1 (T(h Il!Vl (g) 
u Ji,ki), sf i ® 1). Just like in the separated /nested case, we find out that the 
corresponding sum collapses to 

( E E ^ ® (n^V^good), 

»?i,Ki,»?2 ii,72good 

t{h)<2-n{i 2 ) 
hch 

where = (f, h htVl ) l and bfj 2 = (T*(sf ih <g> 1), hj^t. Thus, it is enough to fix 
rji ^ 0, Ki ^ and 772, and then show that 



E h hm ® (nju)* # 

^2 

7i,i"2 good 
<(/i)<2-^(/ 2 ) 
/1C/2 



ra , , £ II/IIl 2 (m)- 

L 2 (/i) 
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An estimate of by now familiar nature shows that ||&/^||BMO|(^ m ) ~ ^fih- ^ e 
have by orthonormality and Lemma |475l that 



h,h good 

e(h)<2—e(i 2 ) 
hch 



h'2 



< 



< 



E E A tjfr 

h good h good 

e(h)<2- r e(i 2 ) 
hch 

2^\\fh\\h^ m )) < II/IU 2 (m)- 
h 



\L 2 (fj, m ) 



2 \ 1/2 



Note that the proof of the boundedness of the summation with the matrix ele- 
ment {hi 2m )h{ T { h hm ® 1 ® s jIj 2 } is analogous. 



6.2. Full paraproducts. We are reduced to considering the summation with the 

matrix element {hi 2m )h{ u J^)h{ T { h h,vi ® u h^i)^ l )- We fix and K i ^ °- 

We need to simplify the summation 



E E E E E ( h h,m)h( u J2,K 2 )ji 

^^m,good 

hCD' n hcV' m 

2 r e{h)<e(i 2 )<2 i 2n(j l )<i(j 2 )<2 t 
x (Qgood, h htm <g>uj 2tK2 )(T*l,h Il>m <S>u Jl>K1 )(f,h Ium ®u JltKl ). 



Notice that 



E E^good, 

J 2 ev' m k 2 
2n(j 1 )<i(j 2 )<2 1 



hi 2 , m ® Uj 2tK2 )uj 2 



h 



((9, 



good, hi 2tV2 )i)s{J t )i 



where 5(Ji) e is the unique cube for which i{S(Ji)) = 2 r £(J 1 ) and Ji C £(Ji). 
Continue to notice that 



E E ^ /2 > r ? 2 ® ^s ood ' hi 2 ,m)^j 
hcv n m 

2 r e(h)<i(h)<2 e 



hxS(h) 



(fl'good)s(/i)xS(Ji), 
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where S(h) e V n is the unique cube for which £(S(h)) = 2 r £(h) and h c S'(Ji). 
Therefore, the expression we started with collapses to 

E E (0good)s(ii)x5(Ji)<r*l, /i Jlj)n <8> u JltK1 )(f, h Iuni ® n JlAl ) 

n,good m,good 

= £ ^ E E (»good>J a xJ a <T*l,/lJi,i7i®"Ji,Ki) 

/lC/ 2 J1CJ2 
<(/l)=2-^(/ 2 ) f(Ji)=2-^(J 2 ) 

x (f,hi um ®u JllKl ). 

One can write this as a pairing (/, LT^ 1 g g0 od), where 

n^ lKl n= E E E ( u )i2xJ2(b,h hm ®uj 1>K1 )h Iim ®uj 1>K1 . 

hED'n h&> n ,good J2&T>' m Ji€T> rngood 
hCh J1CJ2 

6.2. Proposition. T/zere TzoZds t/z«f 

||n^ lKl u|| L 2 (At) < 4||6|| B MO prod ( At )lhllL 2 (/.) ; »7l,«l 7^ 0. 

Proof. There holds that 



inr^iiW) ^ E E i<«>5i 2 K6,^ ® «^>r 

5eX>' R=/xJe£>good, RcS 
gen(i?)=gen(S)+(r,r) 
/■oo 

= 2/ E l^A^®^)! 2 *^ 

^° SeX>' R=IxJeT> good, RcS 

\{u) s \>t gen(i?)=gen(S)+(r,r) 

/•oo 

<2 V V KMi.m ®Mj, Kl )| 2 t^ 



5eX>' R=IxJ&V good, RcS 
Sc{M D ,u>t} gen(_R)=gen(5)+(r,r) 
/■oo 

< IIMIbmcwm 2 / > t})tdt 

= \\b\\ 2 BMO ?n M\\ M V>u\\ 2 L 2 M 

< 16 ll & llBMO prod ( Ai )ll U lli 2 (M)' 

which is what we wanted to prove. □ 

7. Mixed paraproducts 

In the last few sections, we have completed the estimation of the part of the 
Haar series expansion of (Tf, g) involving quadruples of cubes with £{Ii) < £(I 2 ) 
and i(Ji) < £{Jz). The other three subseries with one or both of these inequalities 
reversed are completely symmetric, except for one detail: in the case that the two 



30 TUOMAS HYTONEN AND HENRI MARTIKAINEN 

inequalities go in different directions, we end up with a different form of the full 
paraproduct, which we call mixed. 

The following explanation reveals how it appears. In the case ((h) < ((h) an d 
(( J\) > ((h) one needs to deal with a full paraproduct, which is different in an 
essential way: 

n££,, 6 « == E E E E <«. h ^ ® ^T7T> 

7 i c/ 2 J 2 cJi 

i{h)=2-n{i 2 ) £{j 2 )=2-n{j 1 ) 

X.I 

M^) 

rji 7^ 0, k 2 7^ 0. In the actual summation one has b = 7\(1). We need to sep- 
arately demonstrate that ||n^ d ^ b u\\ L 2 {tl) < || 6|| B MO prod (/,) IMU 2 M- Following 1)201, 
we dualize with au 6 7 2 (/i) and write 

for a suitable bilinear form A. We then need the two estimates 

\(b,f)\ < ||6||BMO pind ( A .)l|/||H 1 (A«)» \\ A ( u , v )\\m^) < \\u\\ L 2 M \\v\\ L 2 {lM) 

for a suitable (ad hoc) "H 1 " norm || • Wh 1 ^)- 

7.1. iJ^BMO type duality inequality. We denote C K := {I e £> n good : 7 C 

7T,gen(7) = gen(7sT) + r} for 7T G V' n , similarly C L C 7? m for L e V' m and C 5 := 
C K x Ci C D for S = K x L e V = V' n x P^. Also, we denote 5(7) = 7T if 7 e C*. 

7.1. Definition. We define the square function 

*W -1L L l</> ^ ® «Wl ^(5( /l) ) / , m (5(7 1 ))i 

>71^0 Jl£.i-V,good 

Ji6£> m , good 

Sev ReC s ^ K ' 

where we suppressed the 771, K\ summation and denoted h R = hi im <g> u j liKl . 

The following proposition is the main result of this section. 

7.2. Proposition. If b G BMO pro d(/x), then for all dyadic grids V n , V' n , V m and T>' m , 
and for all functions f of the form 

f = E E x Vh h h,m ® = E E A/?/ifi 

m^o /iGX> nig0Od sew Rec s 

i<6, />i <ii&iiBMo prod (,)ii< ; S:/iiL 1( ,). 
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Here we use the same suppressed notation as in the above definition. 
Proof. We begin by simply estimating 

\^f)\< E E \0>M)M. 

sew Rec s 

Let 

Xs \ 1/2 



(7-3) <t>'= S %%Zf=(X, E ' A «l 



and set := {0 > 2 fc }, and S k : = {S e V : fi(S n fi fc ) > |/x(-S r )}. Note that if 
S i U fc6 z then K$ n {</> = 0}) > and so 

E £ 2 E i A *i 2 /l(sn „|g) =0}) s 2 L, ^ 2 * = 



{?i=0} 



Therefore, all the relevant 5 in the sum of interest belong to at least one S^. Also, 
Plfccz ^fc = {0 = oo}, which has measure zero if the right side of the claim is finite 
(which we may assume). This means that if S G f] keZ S k , then fi(S) = and so 
Xr = for every R G C#. 

We infer that for all the relevant S in the sum of interest there exist a unique 
ks G Z so that 5 G <Sfc s \ Sk s +i- Thus, we may reorganize 

E E km*>a*i = e E E wmirI- 

sev ReC s k&L S£S k \s k+1 ReC s 

We then bound 

1/2 



E Ekwa*i<( E Ekm 



7I 2 



, . 5G5 fc \5 fc+ i R&Cs SeS k \S k+1 Rec s 

x ( E EM 2 f. 

SG5 fc \5 fc+ i flec s 

Clearly S E S k means that S C Cl k := {Mxn k > §}, where M is the strong dyadic 
maximal operator with respect to V. Thus 

( E Ekwi 2 ) 1/2 <(E Ekwi 2 ) 1 ' 2 

SeS k \s h+1 i?ec s scn k Rtc s 

< ||6|| B Mo prod (,)/i(^) 1/2 < IHIbmo^mM^) 172 . 
For the other factor in (|7.4[) , we use the condition that S 1 ^ <S fe+ i to write 

l<2n(S\£l k+1 )/n(S), 
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and again the condition that S E S k to see that S C Cl k , so that 

E Ei^ 2 / E Ei^i 2 T§T d " 

SeS k \S k+1 ReC s J s&s k \s k+1 ReC s ^ l J 

<2 [ <p 2 dfi<2- (2 fc+1 ) 2 /i(f4) < 2 2 V(^fc). 

Jfl k \Q k+ i 

Substituting back to (|7.4|) , we obtain 

E E KM*>A*I £ ll&llBMO prod ( M )M^) 1/2 -2V(^) 1/2 , 

5e<s fe \-s fc+ i flec s 

and hence 

E E KM*>A*| < E ll&l|BMO prod (,)2 fe /i(^) < HftllBMO^Mll^O.). □ 

7.2. Boundedness of the mixed paraproduct. We are now ready to prove: 

7.5. Proposition. There holds that ||n^ dif) w|| L2(M) < IHIbmo^mIMIiW 

Proof. This proof is adapted from ||20l . 

Let us abbreviate L := ||6||bmo Iod (M)- We wm show that 

We write 



m&w) = 0>, Y, E E E 



XJi 



7 l C/ 2 J 2C Jl 

t{h)=2~n(i 2 ) f(j 2 )=2-^(Ji) 



x X /r \ ® u J2,K 2 )h h . ni ® uj 2)K2 ). 

x U„ (Jo) / 



By Proposition 17.21 we have 

l(nr 2 ed,6«^)l<^ll^/IU 1M) 

where 

/= E E ( M » fef i.»n ® 7§77 v\ )( T, » 7^7 ^ u Jw) h hm ® 
We estimate S v ™ v 7 f pointwise. Let us first note that 

E E K«.*A«f Xsw x ' 2 

XS(Jl) _ \|2 XS(/i) ® XS(J 2 ) 



/i m (S(J 2 ))' 



x <u, 
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This implies that 



qD'n-Dm f < ( \^ 

b V n V'J ^\ 2^ 



Xs(h) 



<g) sup [\(u, hj um (g) 



XS{J 2 ) \|2 



>| Xs(.j 2 )] 



1/2 



X 



( \\i « \i 2 Wa) \ 1/2 



This yields the final pointwise bound 



E 



^1 ^ -^n,good 

x 



Xs(Ji) 
^(^(Ji)) 



®M I?ro ((«,/ iJl)J , 1 ) 1 ) : 



1/2 



( ^»«^ U J 2 .«2>2) 2 ® 



XS(J 2 ) \ 1/2 



J 2 ex>' 



Hm{,S(J 2 )) 



m.good 



Using the above pointwise bound together with Cauchy-Schwarz we get that 



dKP mf u ^ \\ ST 



Xs(/i) 



n,good 



1/2 



L1(m) 



X 



^ M^((t;, Mj2iK2 ) 2 ) 2 ® 



XS(J 2 ) 



1/2 

Ll(M)' 



??i,good 



The proof is ended by noting that 

Xs(/i) 



ri,good 



fl n (S(Ii)) 



M Vm ((u,h Il;V1 ) ] 



Li( M ) 



E 

-^1 ^^ri,good 



Mv m ((u,h hm )if dfMr 



£ / l( M ^/i,r?i)l| 2 ^m < 

/1623„ *™ 



□ 



8. Necessity of Tl e BMO prod (^) 

8.1. Dyadic Journe's lemma for general product measures. We prove Journe's 
covering lemma j|13l with general measures. In this section, we assume that [i = 
Hn x A*m is an arbitrary product of a Borel measure \i n in IR n and \i m in W n . That 
is, we don't assume any growth conditions. Again, we have the dyadic systems 
V n and V m in W 1 and IR m respectively. 

For R = I x J G V := V n x V m , write flW) := /W x jO') and 



gen^H) := gen(I), gen(.R) 



= (gen(/),gen(J)), 
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where i"W [ s the usual dyadic ancestor, and gen(7) is the usual generation of a 
dyadic cube. 

Let f2 c M n+m be such a set that < oo and that for every there exists 
ReVso that i G C fl. We let 

Q := {M X n > J}, 

where M is the strong dyadic maximal operator with respect to P. Define the 
embeddedness of R in f2 as 

embiCR; fi) := sup{A; : # (fe ' 0) C fi}. 

This notation and some related inspiration is derived from (3). 

We say that a dyadic rectangle R = IxJc£lis 2-maximal if / x J (£_ Cl 
for any dyadic J D J. (Obviously we could define emb 2 (-R; f2) and 1-maximality 
analogously. The main point is that in the following result we need to consider 
embeddedness and maximality with respect to different variables.) 

8.1. Theorem (Journe's lemma). Let uj : N — > M+ be a decreasing function with the 
property that J^kLo 00 ^) < oo. Then there holds that 

oo 

w(embi(i2;fi)) x /i(R) <2^u(k) x 

RCQ k=0 
2-maximal 

Proof. Let us define 6{k) := u(k) - u{k + 1) > 0, so that u(j) = J2T=j 5 ( k )- We 
have 

oo 

u{emb 1 {R ] n)) f ji{R) = Y i u){j) E M^) 



(8.2) 



RCfl j=0 RcQ 2-maximal 

2-maximal emb 1 ( R; Q ) = j 

oo oo k 

= E ^) = E^)E E viR), 

fc=0 Ren 2-maximal A;=0 i=0 ReTZ(k.i) 

emb 1 (R;n)<k 



where 

TZ(k, i) := |i? C Q 2-maximal; embi(i?; Q) < k, gen 1 (i?) = i mod k + 1 j. 

We make the following observation about two intersecting R = I x J and 
R! = I' x J' m TZ(k, i). By definition, we have Inl' ^ and JnJ'^0. Suppose 
first that / = I'. But then also J = J', since otherwise one of R and R' could not 
be 2-maximal. Thus I ^ I' whenever R ^ R' . Let us then consider the case that 
I C I'. Then we must have J D J', since otherwise R = IxJCIxJ'cR'c£l 
would contradict the 2-maximality of R. 

For every R — I x J E TZ(k,i), consider the subset 

E(R):=R\ |J # = Ix(j\ |J J'), 

R'=i'xJ'en(k,i) i'xJ'en(k,i) 
i'-di I'DI 
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where we observe that J' c J for all relevant J 1 in the union (i.e., those for which 
R' intersects R), by what we just checked above. The sets E(R), R, e 1Z(k, i), are 
pairwise disjoint; namely, if two different R = IxJ,R' = I'xJ' e lZ(k,i) intersect, 
the previous paragraph implies that either I' D I, in which case R' fl E(R) = 0, 
or else / D I', in which case R fl E(R') = 0. 
We claim that 

(8.3) p(E(R)) = ^(/)/i m (j\ |J J') > ^(i?) = ^/i n (/)/i m (J). 

I'xJ'eTl(k,i) 

I'DI 

This is trivial if /i„(J) = 0. Otherwise, suppose that the opposite estimate is valid. 
Observe that / C I' and gen(J) = i = gen(J') mod + 1 imply that c 
Thus 

J R( fc+1 '°) \ Q c i? (fc+1 < 0) \ |J R' = 7 (fc+1) x ( J \ |J J' 



R'=rxj'&n(k,i) rxj'&n{k,i) 

I'-DI I'OI 



Hence 



and therefore 



i'xJ'en(k,i) 

I'DI 

. e %r u(R( k+1 ^nn) , l l 

mf Mxn > V ; tt-tt^t > 1 - - = -. 



But this means that i?( fc+1 >°) c f2, contradicting embi(/2; fl) < k. 
Hence the claim (|8.3|) is valid. But this means that 

R£K(k,i) ReTZ(k,i) 

where the last step is due to the fact that the sets E(R) are pairwise disjoint and 
all contained in fl. Substituting back to (|8.2[) shows that 

oo oo 

u(embx(R;n))^(R) < J^5(A;) x (k + 1) x 2^(fi) = 2^o;(A;) x □ 

RCH k=0 k=0 

2-maximal 

8.2. A class of bi-parameter singular integrals. In this section we return to the 
setting of upper doubling measures. Let ji = fi n x /i m/ where fi n and fx m are upper 
doubling measures on W l and M m respectively. The corresponding dominating 
functions are denoted by A n and A m . We consider what seems like a slightly 
restricted class of product Calderon-Zygmund operators than above. In fact, 
the definition will be close to the classical setting of Journe, just with general 
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CZ(tl m ) ~ A / I |\ 

*n{Xl, \X\ —yi\) 



X\ — x[ | \ a 1 



measures. We want to demonstrate the necessity of the product BMO condition 
for this subclass. 

We consider product Calderon-Zygmund operators T e C(L 2 (fi)) with the fol- 
lowing vector-valued Calderon-Zygmund structure. When viewing T as an op- 
erator on L 2 (fi n ; L 2 (/i m )), it has a kernel yi) with values in L 2 (/x m )-bounded 
Calderon-Zygmund operators, such that 

($,T¥) = JJ mx 1 ),T 1 {x 1 ,y 1 )^(y 1 )) 2 dfx n {x 1 )d f x n (y 1 ) 

whenever the L 2 (/i m )-valued functions $ and \& are disjointly supported. This 
kernel is required to satisfy the standard estimates 

\\T 1 (x 1 ,y 1 M\^ l .. ,< 1 

and 

V |xi — / A„(xi, |xi - yt\ 
for |xi — yi\ > 2\x\ — x[\. (We do not require regularity in the second variable for 
the present considerations.) Here || \\cz(/i m ) designates the sum of the £(L 2 (/i m ))- 
norm of the operator, and the Calderon-Zygmund constants of its (scalar-valued) 
kernel. We impose the analogous conditions when viewing T as an operator on 
the space L 2 (/i m ; L 2 (/!„)). 

8.4. Remark. Let $ = <3>i <g> $ 2 and * = <g) \& 2 with spt$i n spt^i = 0. 
Then with our original notation we have K$ 2 ^ 2 {xi, yi) = ($2> ^1(^1, 2/1)^2)2/ 
and we have the Calderon-Zygmund bounds with the constant C($ 2 ?^ / 2) ^ 
||$2||L2( Atm )||^2||L 2 ( /im )- If also spt$ 2 n spt\l/2 = 0, then one has a representation 
with the full kernel K (x, y) = K Tl ( Xl , yi )(x2, 2/2)- We will show that for this subclass 
of operators our new non-homogeneous product BMO condition is also neces- 
sary. Although, let us note again that for example the regularity in the second 
variable is not needed in this section. We use this subclass, since it seems that 
it allows more control in situations where not all of the functions are of tensor 
product type, and this type of control is needed here for the first time. 

8.5. Theorem. Let T be an L 2 (n) bounded product Calderon-Zygmund operator in the 
sense above, and b e L°°(n). Then 

!|rfo|| B MO prod ( M ) < ||&|U°°( M )- 

8.3. Initial considerations. We may assume by homogeneity that = 1- 

By Definition 12 . 1 1 1 and Remark 12.121 we need to check the estimate 

E \(h R ,Tb)\ 2 < v(n) 

SeV ReV good, RcS 
Sett g en(_R)=gen(5)+(r,r) 

for any dyadic systems V = V n x V m and V = V' n x V' m , where Q c E n+m is 
a bounded set such that fi(£l) < 00 and that for every x G Q there exists SeV 
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so that x G S C fl We have also used the following short-hand notation: If 
R = I x J, then h R = h IxJ = hj <g> hj is shorthand for h I ri <g> for 77 7^ and 
k^O (that is, only cancellative Haar functions appear). 

We start by setting up some further simplifying notation. For K G V n , let 

C K := {/ G X? n good :ICK, gen(J) = gen(#) + r} 

and 

AT : = (J J c {a* G K : d(xi,K c ) > 2- r7 "£(K)}, 

where the last containment is immediate from the definition of goodness. We use 
similar notation C L and L for L G P^. 

We decompose b = bx& + &Xq c , where Cl := {Mp'Xn > 5} and Mp/ is the strong 
dyadic maximal operator with respect to V. Then 

E E i(^ r te)>r ^ E i(^n&xo))i 2 < r^u* 

s=KxLcnRec K xC L Rev 

< fell' <**(«);$ 

using 

fi(fi) = ^(M^xn > |) < 4||M^ X o|l2 £ llxnlll = M^) 
in the last step. 

So it suffices to prove the estimate with bx& in place of b. For simplicity we 
denote it again by b, assuming from now on that spt b C Vt c . 

For every L G Z^, let Tl be the collection of the maximal F G V' n such that 
F x L C Vt, and denote Fl := Ufs^ ^- ^ there are any cubes F e V' n with 
F x I c fi, then the maximal cubes exist by standard properties of dyadic cubes, 
except if there is a strictly increasing sequence of cubes F k G V' n with F k x L C fl. 
Since our dyadic systems do not have quadrants, such a sequence will exhaust 
all of W 1 , and then in fact R n x L C Q. In this case we set T L : = {R n }, F L : = W n . 
We refer to this as the degenerate case. 

These notions lead to the splitting b = b\ + b\, where 

b 1 L (x l ,x 2 ) := b(x 1 ,x 2 )xF L ( x i)- 

Note that if b\(yi, y 2 ) ^ 0, then y x G F L while (yi, y 2 ) G Q c C (F L x L) c , and hence 
y 2 G L c . Thus spt b\ <Z F L x L c . Note that b\ = b, b\ = in the degenerate case. 
Now we estimate the left hand side in the Theorem by 

E E Ek^ t ^)i 2 - 

i=i s=KxLeV ieC K 
sen JeC L 
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8.4. Analysis of b\. We do not need to pay any special attention to the possible 
degenerate cases in this analysis. The set F L will appear every now and then, but 
it can equally well be the full space W l or a subset thereof. The only property 
that we need is that /j, n (Fi,) < oo whenever this set appears. To see this, observe 
that b\ only appears in the pairing (h IxJ , Tb\), where J G Cl- If ^ m {L) = 0, then 
hjxj = for all J G Cl, and we can ignore such pairings. On the other hand, if 
fi m (L) > 0, then 

Hn{F L )n m {L) = n(F L x L) < fj,(Q) < oo 

implies that ^ n (F L ) < oo, as claimed. 
We begin with: 

8.6. Lemma. For a fixed L G T>' m , the sum over the other variables is estimated as 



LI? 



Kev n iec K 
JeC L 



JmrJaptbKyu-) K d(L J y 2 ) J X m (xi,d(L,y 2 )) 
where x- L is an arbitrarily fixed -point in L. 

Note that we have dropped the restriction that K x L C fi; this part of the 
estimate is true even if we allow K to range over all dyadic intervals. 

Proof. We can write 

(h Ix j,Tb{) = jj hj(x2)(h I ,T2(x2,y2)b 1 L (-,y2))idfi m (x2)dfi m (y 2 ) 

hj(x 2 )(h I , [T 2 (x 2 , y 2 ) - T 2 (cj, y 2 )\b\{-, y 2 ))i d/J, m (x 2 ) dfi m (y 2 ), 

where T 2 (x 2 , y 2 ) is the £(L 2 (/i n ))-valued kernel of T when interpreted as an op- 
erator on L 2 (/i m ; L 2 (/i n )); it acts on the function b\{-, y 2 ) G L 2 (fi n ) (the function is 
bounded and supported on F L ), and ( , ) x designates the inner product in L 2 (fi n ). 
Hence, abbreviating 

Bj(x 2 ,y 2 ) := [T 2 (x2,y 2 ) -T 2 {cj,y 2 )]b' L {-,y 2 ) G L 2 (jU„), 

we have that 

(h Ix j,Tb\) = // hj(x 2 )(h I ,Bj(x 2 ,y 2 )) 1 dii m (x 2 )dii m (y 2 ). 
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Thus, by the orthonormality of the Haar functions hi G L 2 (fi n ), we have (with a 
fixed J eC L ) that 

|2 



£ \(h Ix j,Tbl)\ 2 < £ \(hixJ,Tbl)f 



KeV n 
I&Ck 



< 



< 



1 /2 

(x 2 ) d/i m (y 



\hj(x 2 )\\\Bj(x 2 ,y 2 )\\ L 2 {fln) d/i m (x 2 ) d(i m (y 2 ) 



Since x 2 G J, we have |x 2 — cj\ < £(J)/2. Moreover, since y 2 G L c , there holds 
that 

\cj - y 3 | > d(J,L c ) > t{jy m l{Lf-^ m > e(J) > 2\x 2 - cj\. 
Hence the Holder estimate for T 2 gives that 

\\T 2 {x 2 ,y 2 ) - T 2 (cj,y 2 ))\\ c[L 2^ n)) < 

It follows that 



< 



cj-y 2 \y A m (cj, \cj-y 2 \) 



E i<^xj,T6i: 



< 



*(./) ^ ll&L(-,i&)IU»<*o , , , A ( 

dfJ, m [y 2 ) dn m {x 2 



JgC l 



Cj-J/2K A m (cj, |cj - y 2 |) 



d/i m (y 2 ) 



|cj-y 2 |/ A m (cj, \cj - y 2 | 

By Cauchy-Schwarz, the last brackets squared are dominated by 

£(J) \/3 1 



jc\\cj-y 2 \J X m (cj,\cj -y 2 \) 



dfi m (y 2 ) 



£(j) y \\K(;y2)\\ 2 L 2 ifln) 



cj-y 2 \/ A m (cj, \cj - y 2 \) 



dn m (y 2 ) 



It is a standard fact that 



£{J) Y 



d^ m {y 2 ) < l. 



|cj-2/2K A m (cj, |cj - 2/2 1 

To estimate the other bracket we need a few observations. Since cj G J G 
we have that cj <E L and so |cj — y 2 | > ^(-^> y 2 ) • So we may estimate 

£(L) \/3 1 



£(J) Y 



< 



cj-y 2 \J X m (cj,\cj-y 2 \) \d(L,y 2 )' X m (cj,d(L,y 2 )) 
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We still want to get rid of the last dependence on J, the cj on A m (cj, d(L, y 2 )). 
To this end, let %i be an arbitrarily fixed point in L as in the statement of the 
Lemma. Since cj, %i G L, we have |cj — %i\ < i(L). Moreover, we have d(L, y 2 ) > 
d(L, L c ) > 2~ Tlm i(L) > 2~ Tlm \cj — x- L \. Therefore, we can estimate that 

\ m (cj,d(L,y 2 )) ~ X rn (cj,2 r ^d(L,y 2 )) ~ X m (x L , 2 r ^d(L, y 2 )) ~ X m (x L ,d(L,y 2 )). 

So for the other bracket we get the bound 

d/i m (7/ 2 ) 

d/i m (y 2 ) 



cj-y 2 \y X m (cj,\cj - y 2 \) 



< 



d{L,y 2 ) J X m (x L ,d(L,y 2 )) 

£(L) \f> d^ m {y 2 ) 



< 

l ap tbi( yi ,.) v d(L,y 2 ) y X m (x L ,d(L,y 2 )) 
Since these bounds do not depend on J, the proof is completed by observing that 

It remains to sum the bound of the Lemma over all L G V' m . (Note that F L = 
= spt b\ if there is no dyadic interval F such that F x L c (l.) Thus we need to 
estimate 



2 



KxLeV l£C K 

KxLcn Jec L 



(8.7) < £ ^(^) / / ( 



£(L) ^ dfi m (y 2 ) 



bi(.vw) K d(L,y 2 )' X m (x L ,d(L,y 2 )) 



L-.LBX2 



spt ( 



bi(2/i,-) K d(L,y 2 )s X m (x L ,d(L,y 2 )) 



where the last equality follows by writing fi m (L) = J Xl( x z) ^m(x 2 ) and reorga- 
nizing. 

Note that the outermost double integral may be restricted to Cl. Indeed, let 
x 2 G L and spt b\(yi, •) be nontrivial. Then necessarily y x G F L , and thus (yi, x 2 ) G 
F L x L C Cl. 

For each fixed (yi, x 2 ) G f2, let T = T(y±, x 2 ) denote the maximal L G V m ap- 
pearing in ((821 with the property that x 2 G 1/ and spt b\(yi, •) 7^ 0, so in particular 
yi G Fi. The existence of these maximal cubes L can be seen as follows: Since 
Vt is bounded, we can obviously add the implicit restriction £(L) < diam(f2) to 
the summation on the first line, and then all other lines, in (|8.7[). Thus the size of 
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the relevant cubes L is bounded from above, and then the existence of maximal 
cubes is a standard property of dyadic grids. 

By definition, we have F T x T C Cl and y\ G F T . So in particular {y^ x T C Q. 
Since spt b\ C VL C , it follows that spt b h (yi, •) C T c , so any ?/ 2 in the integral satisfies 
2/2 espt6i,(j/i,-) Cf, 

If z G L, then there is a good cube J G X> m so that z E J C L c T and 
£( J) = 2" r £(L) < 2~ r l(T). The goodness implies that 

\z - y 2 \ > d(J,T c ) > i{Jf m i{T) l -^ m = 2- r " /m £{Ly m £(T) 1 -~ /m . 

We may conclude that every y 2 in the integral satisfies the bound d(L, y 2 ) > 
2- r ~ 1m i(Ly™£(T) l -~ 1m . So we may estimate 

d(L,y 2 )~ p dn m (y 2 ) 
pt^dft,.) \ m (x L ,d(L,y 2 )) 

d(L,y 2 )-Pdfi m (y 2 ) 



< V 

• =0 i{y 2 :2J2-''7m£(Z / )7m£(T)i-7m<(i(L,3/2)<2J+ 1 2-'"T'™£(L)T'm£(T) 1 -T'm} A m (££, d(L, t/ 2 )) 



fj, m (B(x t , [2 + 2 r7 ™] • 2i2- r ^e(LY'™£(T) 1 -"'™)) 
A m (a^, 2J'2-^ m £(L)7™£(T) 1 ^) 



: E(2 i 2- r7m £(L) 7m £(T) 1 ~ 7m 

3=0 

< £(L)~ /37m £(T)~ /3( - 1_7m - ) 
This gives us the bound 



^tiiwMU)^ A m (^(L,y 2 )) -^^V^j 



oc 



= ^2 _/3(1_7mW < I. 

j=0 

Substituting back to (|8.7[) and recalling that the outer double integral is re- 
stricted to fi, we deduce that 



KxLcQIeC K 
J&C L 



E El(^^)l 2 < // t dMi& ) * 2 ) = Ai(n)< A i(n), 

as required. 



8.5. Analysis of 6|. Here we sum the multiple series in a different order: 
8.8. Lemma. For a fixed K G X^, sum oyer f/ze of/zer variables is estimated as 



E Ek^,t6|>i 2 < E^ xG )(I^ 



KxLcQ J^c L 



a(l-7») 
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where Q K consists of the maximal G e V' m such that K x G C Q, and K G e V' n is the 
maximal cube such that K G d K and K G x G C f2. 

Before going to the proof, let us comment on the existence of the maximal cubes 
in the statement of the Lemma. Since Vt is bounded, there is an upper bound for 
£(G) such that K x G C Vt, so Qk is immediately well defined. As for K G , there 
again arises the possibility of an infinite increasing sequence of Kk 3 K with 
Kk x G C Cl. As before, such a sequence will exhaust IR n , thus W n x G C (l, 
and accordingly we interpret Kg := W l , i(K G ) := oo in this case. Note that the 
corresponding terms then vanish on the right of the asserted estimate. 

Proof. Clearly every L as in the left is contained in a unique G &Q K ; thus 



In this sum, we observe that K G x L c K G x G C f2, and hence by definition 
A' G c F L . Now, for every G e fo, there are two possibilities: 

• There is at least one L c G such that F L ^ M. n . Then J<" G c F L is a proper 
dyadic cube. 

• For all L c G, we have Fx, = MJ 1 . But then for all these L, we are in the 
degenerate case with b\ = 0. Hence the part of the sum corresponding to 
these cubes G will vanish, and can be ignored. 

Thus, we can restrict the summation to the cubes G G Qk for which K G e T>' n is a 
proper dyadic cube. 

Now, for summation variables as above, we have that / C K C K C Ag and 
F£ D K G are disjoint, and we may use the partial kernel representation 



£ ^|(/, 7xJ ,T6l)| 2 = £ £ £ £|</> /xj! T6i>| 2 . 





^/(>l)(/ij,7l(£i,?/i)%i, -))2XF£(yi) d// n (xi) d/i n (?/i) 



(^j, yi)) 2 XFi (yi) d/i„(xi) d/i„(j/i), 



where 



We have 



cj- yi \> d(I, F C L ) > d(I, K c ) > £(iy-£(K) 



x - 7 " > £(I) > 2\ Xl - a 



and therefore we may estimate 
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Since b(y 1 , •) e L°°(/x m ) is mapped into BMOg(// m ) by the Calderon-Zygmund 
operator above, we also have 

Plfcl.J/OllBMOlOim) £ H^l (Xl, -Ti(c/, ||cZ( Mro ) 

£(/) r 1 



Estimating XFg(yi) < Xk g (Vi)> we then deduce that with a fixed I E C K there 
holds 



#M z i)l[ Y KRc G (yi) Y \{hj,B I (x 1 ,y 1 )) 2 \ 2 ' d/i„(xi)d/i n (yi; 
n*-r> t r-n 



Geg K LcG 
JeC L 



< 



We then note that 



GcQ K LcG 
J&c L 



Y \(hjiB I (x 1 ,y 1 )) 2 \ 2 < Y \(hj,B I (x 1 ,y 1 )) 2 \ 

LcG J good 

JgCl JCG 

£(J)<2- r e{G) 

< VmWWB^y^Wl^^ 

by Lemma [2.10i This gives us that 



Y Y\^ Th D\ 2 

LcG 

It Mzi)|||£/(zi,yi)|| B MO§( Mm )[ Y V™{G)xic G (yi) ' dfinixi) dfi n (y 

<»n(i)( I (t^-iYt^ 1 J Y »™{G)xK h {yi)\ ir ' V(yi)) 5 

W v|cj-i/i|/ A n (c/, |cj - 2/1 1 ) L J / 



GcQk LcG 
JeC L 



< 



Geg K 



The integral squared is estimated by 

£(i) y d Vn(yi) 
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which is further dominated by 



5^ ^m(G) 



< 



ci-y\\' K(ci,\ci -yi\) 

E mg) f cMl,l) 



yi) J A n (x^,d(if,2/i)) 

\«(l"7n) 



These estimates follow completely analogously to the ones in the previous sub- 
section and utilize goodness in an essential way. 
We have established the bound 

E E E w>™,t®\' z E "»(/) E mo^)" 1 '" 7 "' 

I&Ck G&Gk LcG I&C k G£G k l \ n G) 



Jec L 



, N / £(K) \«(i-tw) 

which ends the proof of the lemma. □ 
It remains to estimate the right side of the Lemma summed over K e V' n : 

K&T>' n G<=g K £{Kg) kxgcq, 

2-maximal 

oo 

< 2 ^V a(1 " 7 ' l)fc x fi(Q) < //(ft). 

Here we used that by the definition of Qk, the rectangles K x G C ft are 2- 
maximal. Second, by the definition of i^c we have 

_ n-embi (-F£TxG;n) 

Lastly, we utilized Theorem 18.11 This completes the proof of Theorem 18.51 
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